AQL: an Alignment Based Language for Querying String Databases

نویسندگان

  • Gösta Grahne
  • Raul Hakli
  • Matti Nykänen
  • Esko Ukkonen
چکیده

AQL stands for Alignment Query Language, and it is an extension of SQL. AQL allows for declarative querying of databases containing strings of characters as entries. For instance, in applications such as genomic databases, the DNA-sequences are strings, and they need to be queried based on their structure. The LIKE-operation in SQL does not go very far. Existing systems for string databases have a very ad hoc nature, usually giving the user a set of predefined simple query forms. On the other hand, there has recently been considerable theoretical advancement in the field of string databases. Several languages have been proposed, and their expressive power is becoming illuminated. However, there has been no practical demonstration of the feasibility of these languages. In this paper we describe an implementation of Alignment Calculus, a powerful string database language based on a modal logic extension of relational calculus. The implementation yields AQL. In AQL the user can write declarative string predicates in the WHERE-clause of a query. The predicates can be of an arbitrary arity, expressing for instance a unary property of a string being a palindrome, or expressing a binary property of one string being the reversal on another, and so on. The string predicates are translated into a novel form of a high level finite state automaton, and they are evaluated using an inference engine for chasing these automata. The AQL system has been implemented as an extension of a relational database management system. Our experiments show that the inference engine has the efficiency of an equivalent query computed in compiled Prolog. This is efficiency achieved already without using optimisation, something which the string queries due to their declarative nature are amenable to.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Safety, Translation and Evaluation of Alignment Calculus

In this paper we pursue the study of Alignment Calculus, a declarative string database query language that supports both string querying and restructuring. This language is aimed for applications such as molecular biology databases, where the basic data type is a string, and the queries are combinatorial in nature. The declarative nature of our language does, however, require some additional ef...

متن کامل

Sequence Datalog: Declarative String Manipulation in Databases

We investigate logic-based query languages for sequence databases , that is, databases in which strings of symbols over a xed alphabet can occur. We discuss diierent approaches to querying strings, including Prolog and Datalog with function symbols, and argue that all of them have important limitations. We then present the semantics of Sequence Datalog, a logic for querying sequence databases, ...

متن کامل

Similarity matching of continuous melody contours for humming querying of melody databases

Music query-by-humming is a challenging problem since the humming query inevitably contains much variation and inaccuracy. Many of the previous methods, which adopt note segmentation and string matching with dynamic programming, suffer drastically from the errors in the note segmentation which affects retrieval accuracy and efficiency. In this paper, we present a novel melody similarity matchin...

متن کامل

Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information

With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...

متن کامل

graphANNIS: A Fast Query Engine for Deeply Annotated Linguistic Corpora

We present graphANNIS, a fast implementation of the established query language AQL for dealing with deeply annotated linguistic corpora. AQL builds on a graphbased abstraction for modeling and exchanging linguistic data, yet all its current implementations use relational databases as storage layer. In contrast, graphANNIS directly implements the ANNIS graph data model in main memory. We show th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998